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ABSTRACT 


The  goal  of  a  sequential  decision  problem  is  to  identify  one  object 
selected  from  a  knovm  set  of  objects  by  applying  a  series  of  tests  to  the 
object.   The  outcomes  of  tests  already  performed  can  be  used  to  select  the 
next  test.   Each  test  may  be  applied  at  most  once.   Given  the  a  priori  probabi- 
lities of  the  objects,  the  cost  of  each  test,  and  the  probabilities  of  the 
outcomes  of  each  test  given  the  object  to  vjhlch  it  is  applied,  an  algorithm 
is  given  for  constructing  the  tree  of  tests  v;hich  minimizes  the  average 
cost  of  identifying  an  object.   It  is  assumed  that  the  test  results  do  not 
depend  on  the  order  in  which  the  tests  are  applied.   The  algorithm  uses  a 
branch  and  bound  procedure  to  construct  partial  decision  trees.   A  lov7er 
bound  on  the  cost  of  completing  a  partial  tree  is  taken  as  the  minimum 
cost  of  a  set  of  tests  V7hich  will  provide  information  about  the  objects 
equal  to  their  entropy,  under  the  assum.ption  that  the  tests  are  independent. 
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A  Lower  Bound 

We  are  given  a  set  X  of  objects  and  a  series  of  K  tests.   The  k   test 
has  outcomes  in  the  set  Y  .   We  are  also  given  the  cost  C  of  each  test  and 
the  joint  probability  distribution  p(>:,  y  ,  .  .  .  ,y,  )  that  object  X  occurs 
and  tests  1  to  k  produce  outcomes  y  to  y  • 

The  entropy  H(X)  of  the  set  of  objects  before  any  tests  are  performed 
is  defined  to  be 

Since  each  term  in  the  sum  is  >  0,  it  is  clear  that  the  entropy  is 
zero  if  and  only  if  the  probability  of  some  object  is  equal  to  1.  There- 
fore, if  the  entropy  is  not  zero  after  several  tests  have  been  applied  it 
is  not  possible  that  the  object  has  been  identified.  Similarly,  if  the 
average  decrease  in  entropy  produced  from  several  tests  is  not  at  least 
equal  to  the  initial  entropy  then  it  is  not  possible  for  the  objects  to 
be  identified  in  every  case. 

The  decrease  in  entropy  from  applying  test  Y,  is 

1(X;Y  )  =  H(X)  -  H(xIy  )  =      -V     p(>:,y,  ,  •  •  •  ,y^)  log  p(x) 
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Similarly,  the  decrease  in  entropy  from  applying  tv.'O  tests  Y  and  Y  is 

H(X)  -  HCXJY.Y  )  =  -Z2  p(x,y^,...,y^,)  log  p(x) 

1  J     Y  Y      Y 

"*'2_/     p(x,y  ,...,y  )  log  p(x|y  y  ) 
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=  I(X;YJ  +  I(X;Y^|y^) 


But  I(X;Y.|y.)  <  I(X;Y.) 


since 


EpCy.)p(y.) 


V7hich  using  the  inequality  log  V7  <  (u  -■  1)  becomes 

Y^  pCy^pCyJ 

l(X;Y.|Y^)  -  l(X;Y.)  <  (   2.   p(x,y,....y,)  -J^^^r  )  -  1  =  0 
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(Note  that  equality  occurs  when  pCy.jy.)  ■=  p(y.!)p(y.).) 

We  use  the  above  vrell-knovm  result  to  compute  a  lower  bound  on  the  cost 
of  testing  the  objects.   The  result  says  that  each  test  yields  at  most  an 
entropy  equal  to  that  Which  results  when  it  is  applied  first.  We  give  the 
tests  the  benefit  of  the  doubt  and  compute  these  entropies.   Using  the 
costs  of  the  tests  vje  then  compute  the  cost  per  unit  of  entropy  hopefully 
to  be  received  from  each  test  and  rank  the  tests  in  the  order  of  increasing 
cost  per  unit.   Finally,  using  this  ranking,  we  form  the  lox-?est  cost  set 
of  tests  v.-'hich  could  possibly  yield  entropy  equal  to  that  of  the  set  of 
objects.   A  fractional  part  of  one  test  may  be  included  in  this  set  in  order 
to  get  the  exact  amount  of  entropy  needed.   VJe  then  include  the  sair.e  fraction 
of  its  cost  in  the  cost  of  this  set,  which  is  the  lower  bound  in  question. 

Branch  and  Bound 


Armed  V7ith  this  lower  bound  we  can  employ  a  branch  and  bound  algorithm 
like  that  described  by  Reinwald  and  Soland  (2) .  We  form  a  tree  of  partial 
decision  trees.   The  root  node  of  the  tree  of  partial  decision  trees 
corresponds  to  no  tests  applied.   A  node  is  a  complete  node  of  a  decision 
tree  if  no  tests  remain  which  have  not  been  applied  in  reaching  that  node, 
if  one  object  has  probability  one  at  that  node,  or  if  branches  extend  from 
that  node.   A  node  of  the  tree  of  partial  decision  trees  is  a  terminal  node 
if  the  decision  tree  at  that  node  contains  no  incomplete  nodes.   The  ave- 
rage cost  of  the  decision  tree  of  each  non-terminal  node  is  bounded  by  the 
costs  of  the  tests  applied  so  far  plus  the  lovjer  bounds  on  the  cost  of 
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each  of  the  sub-decision  trees  v^hich  has  xiot  been  completed. 

When  the  cost  of  a  terminal  node  is  less. than  the  lower  bounds  on  all 
the  non-terninal  nodes,  this  terminal  node  represents  the  optimum  tree. 
When  this  is  not  the  case  the  non-terminal  node  V7hich  contains  the  partial 
decision  tree  V7ith  the  lovrest  cost  bound  is  extended  by  adding  a  branch 
for  each  test  not  yet  applied. 

It  is  possible  that  some  uncertainty  vjill  remain  after  all  of  the  tests 
are  applied.   In  this  case,  more  than  one  order  of  applying  the  tests  vill 
yield  the  same  results.   This  situation  can  be  detected  by  evaluatlns  the 
result  of  applying  all  of  the  tests  in  a  given  order.   If  the  reinaining  un- 
certainty is  small  it  may  be  desirable  to  find  the  best  tree  which  reduces 
the  entropy  below  some  threshold. 
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